A HMM recognition of consonant-vowel syllables from lip contours: the cued speech case

نویسندگان

  • Noureddine Aboutabit
  • Denis Beautemps
  • Jeanne Clarke
  • Laurent Besacier
چکیده

Cued Speech (CS) is a manual code that complements lipreading to enhance speech perception from visual input. The phonetic translation of CS gestures needs to combine the manual CS information with information from the lips, taking into account the desynchronization delay (Attina et al. [1], Aboutabit et al. [2]) between these two flows of information. This paper focuses on HMM recognition of the lip flow for Consonant Vowel (CV) syllables in the French Cued Speech production context. The CV syllables are considered in term of viseme groups that are compatible with the CS system. The HMM modeling is based on parameters derived from both the inner and outer lip contours. The global recognition score of CV syllable reaches 80.3%. This study shows that the errors are mainly observed on consonant groups in the context of high and mid-high rounded vowels. In contrast, CV syllables for anterior non rounded vowels ([ a, , i, , e, ]) and for low and mid-low rounded vowels ([ã , , œ]) are well recognized (in average 87%).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cued Speech automatic recognition in normal-hearing and deaf subjects

This article discusses the automatic recognition of Cued Speech in French based on hidden Markov models (HMMs). Cued Speech is a visual mode which, by using hand shapes in different positions and in combination with lip patterns of speech, makes all the sounds of a spoken language clearly understandable to deaf people. The aim of Cued Speech is to overcome the problems of lipreading and thus en...

متن کامل

A pilot study of temporal organization in Cued Speech production of French syllables: rules for a Cued Speech synthesizer

This study investigated the temporal coordination of the articulators involved in French Cued Speech. Cued Speech is a manual complement to lipreading. It uses handshapes and hand placements to disambiguate series of CV syllables. Hand movements, lip gestures and acoustic data were collected from a speaker certified in manual Cued Speech uttering and coding CV sequences. Experiment I studied ha...

متن کامل

Speaker-independent consonant recognition by integrating discriminant analysis and hmm

In this paper, we propose a new consonant recogmtIOn method which integrates two stochastic method: discriminant analysis and HMM (Hidden Markov Models). Discriminant Analysis is effective to analyze local patterns around the reference-point of a consonant such as a burst point. This method, however, is based on the assumption that the reference-point is detected precisely. HMM is able to extra...

متن کامل

Hand shape Coding for HMM-based Consonant Recognition in Cued Speech for French

Cued Speech (CS) is a visual communication mode that makes use of hand shapes placed in different positions near the face in combination with the natural speech lipreading, to enhance speech perception from visual input. This system is based on the motions of the speaker’s hand moving in close relation with speech. In a CS system, hand shapes are designed to distinguish among consonants and han...

متن کامل

Consonant enhancement effects on speech recognition of hearing-impaired children.

Differences in gain (enhancement, in dB) required to optimize the consonant/vowel intensity ratio in nonsense syllables were determined for stops and fricatives, both voiced and voiceless, in 12 children with congenital moderate to severe sensorineural hearing loss. The test stimuli were vowel/consonant nonsense syllables with various levels of enhancement ranging from 0 dB (for the unprocessed...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007